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Abstract —Recent communication, computation, and technol¬ 
ogy advances coupled with climate change concerns have trans¬ 
formed the near future prospects of electricity transmission, and, 
more notably, distribution systems and microgrids. Distributed 
resources (wind and solar generation, combined heat and power) 
and flexible loads (storage, computing, EV, HVAC) make it 
imperative to increase investment and improve operational ef¬ 
ficiency. Commercial and residential buildings, being the largest 
energy consumption group among flexible loads in microgrids, 
have the largest potential and flexibility to provide demand 
side management. Recent advances in networked systems and 
the anticipated breakthroughs of the Internet of Things will 
enable significant advances in demand response capabilities of 
intelligent load network of power-consuming devices such as 
HVAC components, water heaters, and buildings. In this paper, a 
new operating framework, called packetized direct load control 
(PDLC), is proposed based on the notion of quantization of 
energy demand. This control protocol is built on top of two 
communication protocols that carry either complete or binary 
information regarding the operation status of the appliances. We 
discuss the optimal demand side operation for both protocols and 
analytically derive the performance differences between the pro¬ 
tocols. We propose an optimal reservation strategy for traditional 
and renewable energy for the PDLC in both day-ahead and real 
time markets. In the end we discuss the fundamental trade-off 
between achieving controllability and endowing flexibility. 

Index Terms —Mlcrogrld; direct load control; energy quanti¬ 
zation; electricity market; energy procurement 


I. Introduction; Energy Evolution 

The traditional power system is under significant evolution 
to the smart grid that is driven by technology, economics, 
as well as policy needs Cl. Technology advances provide 
the foundation that enables the full realization of the smart 
grid. One such advancement is the prevailing installation of 
distributed generation (DG) and renewable energy resources 
all around the world. The European Union is the leader in 
developing and deploying renewable generation to reduce the 
dependence on imported energy and greenhouse gas emissions. 
The EU has set a goal to reach 20% share of renewable 
energies in gross energy consumption by 2020. In United 
States, 38 states have long term renewable portfolio standards 
and 14 states have installed more than 1,000 MW of wind 
power m. It is expected that DG and renewable deployment 
will have an annual growth of 2.5% per year until 2040 0. 
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Parallel to the development of generation technologies is the 
profound structural changes in demand. Traditional passive 
demands are becoming intelligent and an increasing number 
of end users are participating in demand response programs 
that enhance grid reliability in presence of uncertain events 
such as failure of generators or transmission lines, renewable 
intermittency, etc. An extra level of reliability is guaranteed 
by the development of storage elements that are installed at 
the microgrid level to store and release energy in case of 
emergencies. 

The information aspects of the power system, i.e. sensing, 
communication, and algorithmic approaches, are the key soft 
elements for improving grid intelligence. Wide-Area Measure¬ 
ment Systems (WAMS), also known as synchrophasors, are 
experiencing a tremendous era in the past decade. The Phasor 
Measurement Unit, known as the PMU, is broadly deployed in 
the transmission network to measure 60 Hz waveforms at fine 
resolution. Based on the concept of a phasor as introduced 
by Charles Proteus Steinmetz in 1893, ||4], modern PMU’s 
can provide synchronized data by using clock signals from 
GPS. Thus high-resolution measurements of real and reactive 
power flows taken at widely separated points in the grid can be 
compared in real time to provide high accuracy security assess¬ 
ment and disturbance localization a, 0 . At the distribution 
and microgrid level, advanced metering infrastructure (AMI) 
integrates sensor, appliances, metering and data management 
systems to enable the information exchange between utilities 
and end users. These infrastructures are typically two-layer 
designed where the first layer is the wired or wireless com¬ 
munication established between meters and appliances, and 
the second layer is the Internet based communication between 
utilities and users. 

Generation advancement, demand flexibility, and informa¬ 
tion technologies together are shaping the traditional power 
system, whose operation has been centralized for the past 
century. The success of the smart grid will depend heavily 
on the operation of distributed microgrids in which smart 
buildings are the most energy consuming parts, but at the 
same time having the highest potential and flexibility to 
manage. In this paper, our foremost focus is on the optimal 
operation of smart buildings assisted by various control and 
communication technologies. Section|^presents a brief review 
of the development and operation of microgrids. Section 
[In] introduces the most important flexible resource of the 
microgrid - smart buildings. We present a brief literature 
review that addresses recent research on modelling, control, 
and energy efficiency aspects of the smart buildings. Inspired 
by the notion of packet switched communication system. 
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we introduce a new operating concept, called the energy 
packet, to the existing direct load control (DLC) frameworks. 
The corresponding framework called packetized direct load 
control (PDLC) is discussed in Section [IV] The rest of the 
paper elaborates the performance opportunities of PDLC under 
two communication and control scenarios. The first scenario 
provides a baseline where the PDLC can smooth demand 
uncertainty and guarantee consumers’ energy needs based on 
real time full state information. The second scenario consid¬ 
ers constrained information passing when only binary state 
information is transmitted. We characterize the performance 
with these protocols in terms of both energy and monetary 
metrics in Section |V1| The impact of volatile resources to 
the system performance is discussed in Section VII Section 


VIII discusses the optimal energy portfolio procurement for 


both traditional and volatile resources. Section m addresses 
the control and market aspects of binary information PDLC. 
Conclusion and future directions are drawn in Section |2 


11. Smart Microgrids 

The concept of microgrid is markedly different from te 
components of a the traditional power system composed of 
numerous generation units, transmission/distribution lines, and 
loads. Shown in Fig. a microgrid is a localized distribution 
system composed of distributed generators, flexible loads, 
and energy storage elements that are all networked through 
advanced communication techniques and are controlled by the 
microgrid operator who is responsible for providing reliable 
and secure electricity service. Drivers of the growth of mi¬ 
crogrids include regulation incentives, consumers’ needs, and 
operation costs. 


Distributed 

Generation 



Fig. 1. A typical microgrid is comprised of smart buildings, distributed 
and renewable generations, energy storage elements, and a central microgrid 
controller. 


The development of microgrids can facilitate the adoption 
of renewable resources 0, a. This is consistent with the 
regulatory goal of the US Government that an increasing 
proportion of renewable energy must be used to serve energy 
demand ||9|, ifTOl . For example, California requires that all sell¬ 
ers of electricity must serve their load with 33% of electricity 


generated by wind by 2020. Massachusetts has committed to 
installed 2,000 MW of wind generation before 2020 as part 
of its plan to reduce greenhouse gas emissions. Microgrids 
can also enhance the resilience of the power system in order 
to withstand both physical and cyber attacks CD, d, Cl. 
When an major disturbance occurs, such as super-storms 
or floods, the localized microgrid can actively island itself 
from the main grid to operate using its distributed generators 
and energy storage elements. The detach of microgrid from 
the main system will also prevent the occurrence of large 
scale cascaded failure. In addition, microgrid operation is 
facilitated by a few advanced information technologies that 
are ready to use. The advent of Internet of Things allows 
the networked connectivity between the operator and the 
flexible loads such that communication signals are exchanged 
and control signals are executed M, ca, El. Metering 
technologies together with data analytics enable the operator 
to gain better understanding of consumers preferences and 
behaviours. In addition, various software have been developed 
to maximize the economic use of resources and to minimize 
the costs of consuming electricity. 

Parallel to the potential benefits that the microgrid will bring 
to power systems, research has been conducted to optimize the 
operation of microgrids such that maximum economic benefit 
can indeed be achieved. Renewable resources, whose genera¬ 
tion outputs largely depend on the stochastic environments, 
need to be coordinated and controlled. One solution is to 
build hybrid power systems (HPS) such that the controlled 
system can enable the exchange between different power 
units ini, Qa, m. The concept of virtual power plant 
(VPP) was proposed as a cluster of distributed generation 
installations that is controlled by a central authority Go), ED, 
ED- For example, generation sources with different dynamic 
response rates can be combined to achieve a higher operational 
efficiency that cannot be achieved by any of the sources used 
individually ll23l . Il24ll . HPS can not only stabilize its output 
to have minimum oscillation, but it can be built to provide 
ancillary service. In contrast to generation management is 
flexible loads management that is optimized to utilize energy 
at the right time with the right price. Flexible demands can 
be classified according to their operational requirements. Non- 
interruptible loads, such as washing machines or dishwashers, 
have the lowest level of flexibility that cannot be interrupted 
in operation, whereas interruptible loads provide more degrees 
of freedom as long as consumer specified requirements are 
guaranteed to be met. Intra-microgrid load coordination can be 
extended to inter-microgrids coordination within the vicinity of 
the same distribution network. When each microgrid operator 
can extract the aggregated demand preferences represent¬ 
ing the overall utility function, distributed or decentralized 
algorithms can be applied among microgrid operators and 
the regional transmission operators to reduce the costs of 
consuming electricity, the costs of transmission congestion, 
and the need for operating reserves ED, Ei, ED- 

HI. Smart Buildings 

Smart buildings, both residential and commercial, are the 
most important elements in the microgrids that account for 
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a surprising 40% of total energy demand. The objective of 
building operation is to improve the reliability, sustainability, 
and efficiency of electricity usage while guaranteeing the 
comfort requirements of the building occupants. Thanks to the 
fast paced development of networked control technologies ||28]| 
and novel concepts enabled by smart appliances, the control 
of a group of appliances has become a reality that enables the 
building to become smart. Advanced buildings consumption 
analytics are provided by various companies, like OPower, 
C3 Energy, FirstFuel, to assist consumers in learning their 
energy consumption characteristics and help to improve energy 
efficiency. Companies like Honeywell, ABB, and Nest pro¬ 
duce smart interfaces that have IP addresses to be controlled 
remotely from the Internet. The controlled appliances can be 
deployed in demand response programs that are executed to 
improve the economics of energy usage. The bar chart below 
shows the percentage of electricity consumption from different 
appliances in both residential and commercial buildings. It 
can be seen that loads with energy storage, such as HVAC, 
refrigeration, consume nearly 50% of the overall consumption. 
Lighting in commercial buildings also provides big opportu¬ 
nity to affect the overall consumption if it can be operated 
at multiple dimming modes. In addition, non-interruptible but 
delay-able loads, such as dryers or washer, can contribute to 
the flexible shifting of aggregated consumption. 


0.45 

0.4 

0.35 

0.3 

St 

S 0.25 

S 0.2 
£ 

0.15 

0.1 

0.05 

0 


Fig. 2. Bar chart of electricity consumption in both commercial and 
residential buildings. Flexible loads, such as HVAC, refrigeration, and lighting, 
consume more than 50% of the aggregated consumption. 

The first step towards smart building management is to build 
accurate models to describe the dynamics of the states that 
are controlled by the appliances. Early work focusing on the 
modelling of individual and aggregated HVAC appliance was 
proposed in ll29l . BOl . OTIl . The thermodynamic model of the 
room temperature, which is a complex function determined by 
the building construction, thermal sources, and meteorological 
inputs, can be reduced to a linear differential equation with a 
few parameters containing the effective thermal time constant, 
temperature gain of appliances, etc. all of which are estimated 
by designing a cold load pick-up process. Following the idea of 
duty cycle modelling, controlled HVAC aggregated modelling 
in demand side management has been studied extensively in 
the past decade. The change of load diversity and synchroniza¬ 
tion was analysed in the case that direct load control (DEC) is 
used in demand reduction to shift peak load to non-peak hours 



ll32]| . II 33 I . II 34 II . 051 . When aggregated HVAC appliances are 
used to provide fast ramping regulation reserves, the load 
diversity will be affected by the past DEC profiles and price 
signals. This results in even more complex representation of 
the diversity 0^ . OTl . OSl . O^ . which is then characterized 
by a few statistical parameters that can be used as the state 
variable representing the aggregation. 

Upon understanding the dynamical model of appliances, 
control architectures can be applied to provide demand re¬ 
duction with minimum users’ comfort degradation iOl, El- 
Ancillary service can also be enabled to reduce the need for 
secondary reserve in the presence of renewables BTIl . Il42ll . For 
example, the thermal dynamic model provides capacity bounds 
on the amount of regulation reserve that smart buildings can 
provide ll42l . Il43l . With a reasonable level of reserve capacity, 
the building operator can design either actuator based DEC or 
price based indirect load control to modulate the aggregated 
consumption ll44l . ll45l . Bril . BtI . ll4^ . If this was done with 
inaccurate thermal modeling and capacity overestimation, it 
could only achieve 50% to 60% of current demand response 
pilot programs ll49l . We formulate an optimization problem 
that can jointly minimize the summed costs of providing 
regulation reserve along with a consumer’s disutility. The 
formulation is either based on dynamic system models or 
Markov decision process. Another source of flexibility is pro¬ 
vided by interruptible loads that are deadline constrained, e.g. 
electric vehicles, washing machines. For example, large groups 
of electric vehicles can be coordinated to fill the overnight 
demand valley, to mitigate renewable energy intermittency, and 
to reduce transmission congestion EHl, ED, EH, ES, El, 

ES, El. 

In addition to demand response and the grid-wide benefits 
of ancillary services, smart building micro-grids have been 
identified as potential high-value resources for energy con¬ 
servation and sustainability. There has been a standardization 
of an energy efficiency index that relates a smart buildings 
consumption to a reference level of consumption E3, El, 
ll59l . Measurements of physical characteristics of consumption 
can be compared with estimates of activity levels of building 
occupants to support this index. Energy savings based on 
direct sensing of room occupancy and task-specific lighting, 
when combined with advances in bulb efficiency, can provide 
significant reductions in consumption, on the order of 44% to 
50% according to some reports E3, ESI, ED- 

IV. Energy Packets and PDLC 

The idea of an energy packet and its associated control 
protocols are motivated by the notion of data packets and 
packet switched communication networks. We see a lot of 
structural and functional similarities between communication 
and power systems - they both have networked topology in 
which resources should be delivered with minimum disruption. 
Packet switched communication has shown its advantage in 
improving system fairness and network efficiency with limited 
transmission capacity. This is also a critical issue in power 
systems, namely how we can improve the energy quality of 
service to guarantee minimum energy disruption in demand 
response given a prescribed level of energy capacity. 
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In this section we introduce energy packets, as well as the 
associated control and communication protocols built on them 
that we call packetized direct load control (PDLC). Differing 
from traditional energy distribution where appliances consume 
electricity according to their duty cycles, in PDLC appliances 
will consume quantized energy in the form of energy packets 
defined as follows: 

Definition 1. An energy packet for a given appliance is a 
fixed time interval 5 during which electricity is consumed 
at the appliance’s rated power with its nominal voltage and 
current. 



Fig. 3. Continuous electricity consumption in the operating cycle is quantized 
into energy packets with short duration of 6 . Energy demand in a traditional 
duty cycle is therefore successive multiple demands of energy packets. 

Fig. 1^ illustrates the concept of energy packets where 
continuous demand of electricity is quantized into multiple 
successive demands. The unit of demand and supply for 
electricity is therefore one energy packet. The value of 5 can 
be designed based on the contract between the operator and 
consumers and will affect system performance. This will be 
discussed later. 

In PDLC, group of flexible appliances of each consumer 
is connected to the central controller operated by the smart 
building operator (SBO) through a local area network where 
bi-directional communication/control is established such that: 

(i) appliances can send to the SBO their instantaneous state 
information signal that represents their desired comfort level or 
preferences of electricity consumption, and (ii) the SBO will 
send a binary control signal that executes direct control of the 
on/off switch of each appliance. In the discrete time control 
system, 5 is both the duration of the energy packet and the 
decision interval. The communication from the appliances to 
the SBO indicates their energy needs for the next interval. 
Different choices of communication protocols can be built on 
top of the PDLC framework so that consumers can send var¬ 
ious types of information based on their willingness to share 
information. An ideal communication signal from consumers 
contains the state of both the appliance and the environment 
regulated by the appliance. For example, an air conditioning 
appliance can report both its operating status as well as the 
desired temperature set point. As an alternative communication 
protocol, the appliances can choose to send binary information 
to the SBO stating whether or not they wish to consume a 15- 
packet of electricity. This scenario corresponds to consumers 


wishing to protect their privacy or to a building automation 
monitoring system that is not reliable. The control from the 
SBO, which is executed at the beginning of each interval, is the 
decision on energy packets authorization to all appliances for 
the following one interval based on the received information. 
An appliance is allowed to consume one energy packet if it 
receives the authorization, and it needs to send signals in future 
intervals if additional packets are needed. The following few 
points comprise the remaining background of PDLC. 

(1) The PDLC targets at controlling duty cycle appliances 
with thermal storage that includes air conditioners, water 
heaters, etc. Since our thermal models duty cycle appliances 
are all governed by first order ordinary differential equations, 
results in the rest of the paper focusing on the control of air 
conditioning units can be easily extended to alternative control 
objectives with minor changes. 

(2) We assume duty cycle appliances have binary on/off 
operation modes. In the on mode, an appliance can only 
operate at its rated power and consume one unit of energy 
packet given the authorization from the SBO. In the off mode, 
an appliance consumes zero energy. There is no intermediate 
operation mode where consumers can flexibly adjust the power 
consumption of its appliance between the rated power and 
zero. 

(3) Various types of appliances are separated by different 
feeders that are connected to the SBO. Appliances belonging 
to the same feeder have the same rated power and therefore 
consume the same unit of energy packets. The control of 
electricity consumption in the smart building is based on the 
parallel PDLC of each feeder. For inelastic and uncontrol¬ 
lable appliances, we assume they are connected to alternative 
feeders that are excluded from the analysis of the PDLC 
framework. 

V. Hierarchy oe Ineormation Communication 

We would like to characterize the best performance un¬ 
der two different levels of communication scenarios. The 
performance measure should cover both the system aspect 
and consumer aspect. For the system aspect, the controlled 
output from the SBO, which is the aggregated electricity 
consumption, would like to follow the purchased amount of 
energy packets with minimum deviation. For consumer aspect, 
the SBO should provide energy packets to guarantee that 
consumers’ comfort be maintained within the desired range. In 
the following, we first discuss the baseline control performance 
where full state information is transmitted from appliances to 
the SBO. The second part will consider constrained commu¬ 
nication where limited binary information is transmitted. 

A. Full Information Communication 

In the first communication scenario, we assume that sensors 
are installed around air conditioning units and real time 
data/information is sent to the SBO at a frequency higher than 
5 through an erasure free communication channel based on the 
local area network. At time t, the information transmitted from 
consumer i is a paired value of temperature and operating 
status {T*(f),0(l)} where 0 (1) means appliance is at duty 
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off (on) cycle. The objective of the SBO is to determine and 
reserve the minimum needed number of energy packets m 
from the day ahead market to serve appliances within the 
building such that all consumers’ room temperatures can be 
controlled within their preferred range. To quantify consumers’ 
preferences of comfort, each consumer i is only required to 
provide two values to specify his/her preference of comfort 
temperature control; a preferred set point value and a 
preferred maximum deviation value A*. The control system 
performance is therefore evaluated by (i) consumption de¬ 
viation away from the reserved m energy packets and (ii) 
consumers’ dissatisfaction if the room temperature cannot be 
controlled within + A®] at some time. 

Based on the real time T(t) collected from all appliances, 
the SBO can predict the evolution of T{t) for the next 6 time 
interval based on the dynamic model 


dt 


T{t) 


Tout - T{t) - TgU + w{t) 

T 


( 1 ) 


where Tout is the outside temperature, Tg is the temperature 
gain when the AC operates, t is the effective thermal time 
constant, w{t) is bounded measurement error or stochastic 
disturbance, and it is a binary variable specifying the on/off 
operating status, as well as the instant temperature information 
of all consumers. This full information enables the SBO to: 
1) allocate packets to pre-cool a room even if energy is 
not needed to remain in the comfort zone; 2) withdraw a 
packet from rooms having high temperature but still capable 
of acting as heat absorbing sources before temperatures reach 
the maximum comfort threshold. 

We show in 16^ that the minimum number of energy 
packets m that need to be purchased at any time t can be 
determined by the total number of consumers N, that con¬ 
sumers’ preferred set points, and properties of the appliance 


NTout-EtiT. 


m = 


set 


( 2 ) 


In addition, with proper choice of energy packet length 6, the 
smart building can consume exactly m energy packets at any 
decision interval such that no electricity consumption deviation 
would occur while maintaining the room temperature of all 
consumers to lie within their designated comfort bands. The 
result is formally stated as follows: 

Proposition 1 Assuming that r*(/*) S — T^^^+A''] 

for all i and the SBO purchases a fixed number m = {NTout — 
Tl^^)/Tg of packets, then there exists a positive <5 such 
that with a fixed number of m packets allocated starting 
from time t*, the PDLC can control all consumers’ room 
temperatures to be in [T^^t ~ ^\T^et + for all t > t*. 

To this end, it can be seen that m in (|^ is the critical amount 
of energy that needs to be reserved in full information PDLC. 
The control performance based on full information is perfect 
in the sense that no aggregated consumption deviation from 
m and consumer’s comfort loss occurs. 


B. Binary Information Communication 

In the binary communication setup, we fix the value of m 
as in Q and derive the control performance for this protocol. 


When the SBO cannot acquire real time full information 
from appliances due to consumer privacy concerns, inaccu¬ 
rate temperature monitoring, etc, we consider a constrained 
communication that the appliances only signal the SBO binary 
information regarding their need of energy packets for the next 
interval. Namely the appliance will send a request signal to the 
SBO if it wishes to consume energy or a relinquish signal if it 
does not need energy for the next interval. The information 
loss to the SBO is the T*(f) that was used to construct 
the thermal dynamics in Q. The SBO, after receiving all 
requests during each interval, will authorize energy packets to 
a maximum number of m for the next interval based on first- 
in-first-serve queuing principle. For requests received after the 
first m, the SBO will hold them and activate authorization in 
the future intervals. 

Since the SBO does not have real time temperature infor¬ 
mation, it models the energy request (arrival) and withdrawal 
(departure) process based on the duty cycle of appliances. 
Assuming the energy packet request rate from an idle AC is A 
(which is related to duty cycle off time as 1/A), the withdrawal 
rate from an operating AC is p, (which is related to duty cycle 
on time as 1 /p), and the energy packet length is 6, the binary 
information based control system can be described as a closed 
loop queuing network with N appliances and m servers. As 
such, there is an associated probability distribution pin, 5) 
for the number of consuming appliances n = 0,..., m in 
steady state. This distribution is determined by the steady state 
probability distribution of the number of appliances p{x, S) for 
X = 0,..., N in the queue. 

When X appliances are in the queue at time t, k = 
min(a;, m) servers are operating to serve the appliances with 
packet duration S. All k appliances will finish the energy 
packet [/, f -f (5] by the end of the interval and they will 
independently decide whether to request additional packets. 
With probability p{6) = e~^^, an appliance will request an 
additional packet and there is no departure in this case. With 
probability 1 —p{S) = 1 — an appliance will switch into 
the idle state that results in one departure. Since the probability 
of departure linearly increases with the departure rate for small 
value of 6, the departure rate for an appliance is (1 —p{S))/6. 
Therefore the departure rate with k operating appliances is 
k\l —p{5)]/5. In addition, the arrival rate that packet requests 
are received from idle appliances is {N — x)X. We can solve 
for the steady state probability distribution p{x, 6) based on 
the departure/arrival rate 

p{x, 6){N - x)X = p{x + 1, - p{5)]/5, 

where k = mm{x,m). This yields 

where r{S) = steady state probability distri¬ 

bution can be gotten after normalizing p{x,6). 

p{x, 6) can be used to quantify the performance of the 
closed loop queuing network that includes the mean queue 
length, the average service (waiting pluses operating) time. 
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etc. We define the number of operating appliances as 


p{n,S) 


p{n^S), n = 0,...,m—1 

N 

Pik,S), n = m 

k—m 


(3) 


Then the first component of the system performance, which is 
the uncertainty of energy packet consumption is the variance 
of the random variable n based on the distribution in (0. 
The measure of consumers’ comfort loss can be calculated 
based on the total waiting time for consumers to complete 
their energy needs. The average number of consumers in the 
queue is 

N 

Q{m,S) ='^p{x,6)x. 

x—0 


The average arrival rate is 


Aave = A(iV - Q{m, S)). 


Based on Little’s Law, the average time consumers spent in 
the queue, namely the service time, is 


S{m, 6) 


Q{rn, 6) 

X{N - Q{m,S)) 


A. Energy Function Metric 

The energy metric optimization is based on the intuition 
that a large number of reserved energy packets will result in 
unnecessary excess capacity, while a small number of energy 
packets will result in undesirable capacity deficiency. The 
optimal decision must strike the balance between the two costs. 
Based on the steady state probability distribution of consumers 
in the queue p{x) for the system procuring m energy packets 
in the day ahead market, we define the capacity deficiency as 

N 

De{m) = p{x)[x — m). 

And we define the excess capacity as 

m— 1 

Ex{m) = p{x){m — x). 

X—0 

Since the units of both the excess capacity and the capacity 
deficiency are energy packets, we propose that the optimal 
energy packet procurement m* can be gotten by minimizing 
the summed cost of the two metrics. Denote T{m) as the total 
performance degradation, the SBO can determine the optimal 
energy procurement m* based on energy metric minimization 


Since the uncontrolled duty cycle expected operating time is 
l//i, the extra time that consumer’s spent in the system is 

W{m,5) = S{m,5) -. (4) 

P 

W (to, 5) characterizes the binary information based control 
performance for consumer utility. The system measure of 
performance is the uncertainty (variance) of the distribution 
Var(p(n, (5)). We showed in 1631 that there is a trade off 
between W{m,5) and Var(p(n, i5)) as i5 is varied. 

Proposition 2 As A increases, Var(n(TO, <5)) will decrease 
and W(m,S) will increase. 

Remark 1. Proposition 1 and Proposition 2 provide the 
quantification of the difference in system performance when 
information is degraded from full to binary level. We see costs 
exist for both consumers (extra waiting time W{m,6)) and 
the overall system (demand uncertainty Var(p(n, A))). In the 
following, we will transform these two quantifications into 
common measuring units and propose a corresponding energy 
reservation strategy for the optimal value of to. 

VI. Metrics of System Pereormance 

When procuring the same energy packet level at to. Section 
[V] shows that both performance in terms of consumption 
uncertainty and consumers’ dis-utility will degrade when the 
information level is degraded from continuous to binary. It 
is clear that procuring to energy packets is optimal for the 
SBO having continuous state information. We next find the 
optimal energy procurement strategy when the SBO only has 
access to binary information. In what follows, we will provide 
two metrics that will be used in optimizing to. For notational 
simplicity, the packet duration 6 will be fixed based on the 
contract established between the SBO and consumers and will 
be omitted. 


min E(to) = Ex{m) + De{m). (5) 

7n^[l,N] 

We show that E(to) is a convex function guaranteeing that the 
optimal TO* is unique. 

Proposition 3 E(to) is a convex function of to. 

Proof. We prove by showing that both the excess capacity 
Ex{m) and the capacity deficiency De{m) are convex func¬ 
tions of TO. The relation between the excess capacity Ex{m) 
and the system throughput Th{m) is 

[to — Ex(rn)]p, = Th{m). 

It has been shown that in a closed queuing network Th{m) 
is a concave function of to IMl . and therefore Ex{m) is a 
convex function of to. To see the convexity of De{m), we 
have 

N 

De{m) = p{x)(x — m) 

x—m-\-l 

N m (6) 

= ^ p{x){x — to) — ^ p{x){x — to) 

x—0 x—0 

= Q(m) — TO + Ex{m) 

where Q{m) is the mean number of consumers in the queue. 
The steady state arrival rate of the closed queue is product of 
the single arrival rate and the mean number of idle appliances: 
X[N—Q{m)]. The steady state departure rate is the throughput 
of the system: p,[m — Ex{m)]. Since the steady state arrival 
and departure rates are equal to each other 

A[A^ — Q{m)] = p,[m — Ex(m)], (7) 

we have 

Q(m) = N — ^[m — Ex{m)]. (8) 

A 

Substituting (j^ into we get 

De{m) = N — {1 + f^)[m — Ex{m)] 

A 
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is a convex function of m. □ 

If we express both Ex{m) and De{m) as a function of the 
mean queue length Q{m), then E(m) becomes 

E(to) = (1 + 2A//i)Q(m) + m — 2A/P-/V. (9) 


Hence the optimal m* is chosen to satisfy 




1 

l + 2A//r' 


Remark 2. The energy metric optimization is not restricted 
to the form in Q. In fact we can add weight coefficient on both 
Ex{m) and De{m). One reasonable weight on Ex{m) is the 
day-ahead energy costs penalizing unnecessary procurement, 
and weight on De{m) is the predicted real time energy costs 
penalizing the balancing purchase caused by deficiency. 


B. Welfare Function Metric 

The concept of capacity dehciency and excess capacity can 
be translated to derive a welfare metric based optimization 
- (i) capacity deficiency results in a dis-continuity of elec¬ 
tricity consumption and therefore results in room temperature 
deviations from the allowable range, and (ii) excess capacity 
incurs unnecessary energy reserve costs. These two costs are 
quantihed as follows. 

The capacity dehciency characterizes the extra waiting time 
consumers need to spend in the queuing system to complete 
the energy packets service for one duty cycle. Based on (|^ 
and (j^, De{m) is related to Q{m) by 

De{m) = (1 + —)Q{m) — —N. 


Therefore the consumer’s extra time spent in the system, 
W (to) in Q, can be derived from Little’s Law: 


W{m) 


De{m) + i 

X[N — De(m)] fi 


It can be shown that Wfm) < 0, W"(rn) > 0, and therefore 
W{m) is a decreasing convex function. When the comfort 
bandwidth is relatively small compared to the distance be¬ 
tween indoor and outdoor temperatures, the room temperature 
deviation AT(to) drifts linearly with the consumers’ waiting 
time. Hence 

AT{m) = KW{m), 


where temperature drift rate k can be determined by the duty 
off cycle. The welfare characterization of consumer’s dis¬ 
utility g{-) can be dehned as a function of the temperature de¬ 
viation AT(to) as g{A{m)). If the utility function g{AT{m)) 
is convex and non-decreasing w.r.t AT(to), as assumed in 
ll65l . ll6^ . g{AT{m)) is then a monotonically decreasing and 
convex function. This can be easily checked by the first and 
second order conditions 

and 

a^p(Ar(TO)) = g^4^g(AT(TO))AT'(TO)2+ 

^^^^g{AT{m))AT"{m) > 0 . 


Hence g{AT{m)) will be used to translate capacity dehciency 
in energy metric optimization into consumers’ utility in wel¬ 
fare optimization. 

Next we translate excess capacity into the welfare metric 
which can be done straightforwardly as follows. The penalty 
h{-) for having of excess capacity can be dehned as a linear 
function of Ex{m) where the coefficient can be either the en¬ 
ergy purchasing price or the energy reserving price depending 
on the contract between the SBO and the ISO. Given Ex{m) 
is convex, h{-) is also convex. To this end, the monetary metric 
optimization solves for to* such that 

TO* = arg min g(AT{m)) + h(Ex(m)). 

Similar to the energy metric optimization, we can transform 
both AT(to) and Ex{m) to Q{m). The unique welfare based 
solution can be gotten by minimizing the welfare metric W(to) 
dehned as 

W(m) = - i)) ) |JV - 0(m)]). 

( 10 ) 


VH. Impact of Renewable Energy 

When the SBO procures day-ahead energy packets from 
traditional resources, there is no volatility in terms of the actual 
amount of energy packets that the SBO can get. Therefore 
the day-ahead energy procurement strategy is to purchase to* 
minimizing the energy metric Q or the welfare perspective 
(lOl. However, when the SBO purchases energy with a com¬ 


bination of deterministic (traditional) and volatile (renewable) 
resources, the actual number of available packets as well as 
the real time performance will be stochastic that depends on 
the probability distribution of the volatile resources. In the 
following, we analyze the impact of volatile energy on the 
system performance. 

We begin by modeling the distribution of volatile resources 
and focus on wind generation. We use a Gaussian as the wind 
prediction error model. This is typically used for short term 
wind prediction for time scales between 6 and 48 hours ahead 
of operating time Ell, ED- Suppose the SBO has access to 
wind generation resources, and suppose the realization of wind 
energy satisfies the following Gaussian distribution with 
prediction mean Pr and variance tr^ 


P,^N{Pr,a^). 


In addition to wind energy, the SBO can choose to purchase a 
certain amount of traditional energy denoted as Pt. This yields 
the following probability distribution of the number of packets 
TO that will be available in real time 


m = P, + Pt^ N{Pr + Pt,a^), (11) 

and yields the following expected system welfare 'W{Pr, Pt, <j) 
dehned in ( [TOl 

W{Pr,Pt,cr) = J p{m,Pr,Pt,(j)W{m)dm (12) 

where p(rn,Pr,Pt,er) is the probability that the SBO will 
have TO number of available packets governed by the Gaussian 










distribution in •dD- Similar to the uniqueness of m* when 
the SBO procures solely traditional energy, we prove that the 
optimal procurement Pt is unique when the SBO is provided 
with the wind resources distribution. 

Proposition 4 For given {Pr,a} characterizing the wind 
energy, there is a unique Pt that minimizes W{Pr, Pt,o-)- 
Proof Since m ~ N{Pt + Pr,<j‘^), we have 

/ I (m-Pr-PtP 

e 2 ^;^ W(m)fim. 

V27rCT^ 

Letting M = {m — Pr — Pt)/a- he a change of variable 


Letting m = 2P — M and using the fact that a Gaussian 
distribution is symmetric around P: p{m, P,ai) = p{2P — 
m, P, tJi) we have 

+ 00 _ 

J p{m,P^ai)W{m)dm 

P-\-k 

—oo _ _ _ _ 

= / p{2P - M,P,ai)W{2P - M)d{2P - M), 

P-k 

P—k 

= J p{M,P,ai)Wi2P - M)dM. 

— OO 

(17) 

Similarly, 


1 f m2 

W(P^,Pt) = / e-^W{Pr + Pt + Mcr)dM. (13) 

v27rtT2 J 

Based on Leibniz integral rule, the second order derivative of 
the cost function w.r.t Pt is 

+ « + (M) 

which is strictly positive due to the convexity of W( ). Hence 
'W{Pr, Pt, O') is a convex function of Pt for volatile features 
{Pr, cr}, and we have a unique optimal Pt. □ 

We proceed to analyze the impact of wind energy on the 
optimal system welfare. We hrst discuss the scenario when Pr 
and a are uncorrelated. The result is shown in Proposition 3 
that increased uncertainty a will decrease the welfare. This 
result is then extended to wind output having correlated mean 
and variance where the variance linearly increases with the 
mean a = kPr- This corresponds to the case where the wind 
farm output is the summation of small wind mills locating at 
the same spot. 

Proposition 5 Dehne 


P+k P 

/ p{m,P,ai)'W{m)dm = / p{m,P,ai)'W{2P—m)dm. 


p-k 


(18) 


Substituting (17 1 and (18 1 into (16 1 we have 


p-k 


W(P, (Ti) = / p(to, P, cri)[W(TO) + W(2P — TO)](im+ 


p _ 

f p(m,P,ai)[W(m)+W(2P — m)]dm. 

P-k 

(19) 

Similarly we can have the expression of W(P, ( 72 ) which 
together with ( [T9| ) is used to derive 

W(P,ai) - W(P,(J2) 

p-k 

= / [p(m, P, CTi) — p(m, P, (T2)][W(m) + W(2P — m)]dm+ 

— OO 

P _ _ 

J [p{m, P, CTi) — p{m, P, (T 2 )[W(m) + W(2P — m)]dm. 

p-k 

( 20 ) 

For any mi, m 2 such that mi < P — k < m 2 < P, we have 


F(P,,(t) =minW(P,,Pt,a) (15) 

Pt 

then P(-,-) increases as a increases. 

Proof. Consider two levels of volatile resource uncertainty 
(Ti < tT 2 - Denote Pj^ as the optimal solution for {Pr-,cr 2 }, 
and denote P = P^ + Pt 2 as the mean number of packets 
expected in real time. We compare the value of W(P, CTi) and 
W(P,a2). 

The probability distribution of the number of packets in the 
two scenarios are mi ~ N[P, af) and m 2 N{P, o"!)- Based 
on the symmetric property of Gaussian distributions, there is 
a positive value of k such that 

f p{m,P,ai) < p{m,P,a 2 ), if |m - P| > fc 
1^ p{m,P,ai) > p{m,P,a 2 ), otherwise. 

We have 

W(P,ai) 

P—k _ +00 _ 

= f p(m, P, ai)W(m)dm + f p(m, P, cri)W(m)dm 
-00 P+k 

P _ P+k 

f p(m, P, ai)W(m)dm + f p(m, P, ai)W(m)dm. 
P-k P 

( 16 ) 


[W(mi) + W(2P - mi)] - [W(m 2 ) + W(2P - m 2 )] 

= ]W(mi) - W(m 2 )] + JW(2P - mi) - W(2P - m 2 )] 

> '7F)^[W(mi)-W(P)] + ^^™]W(2P-mi)-W(P)] 

^fF)^[W(mi)-2W(P)+W(2P-mi)]>0. (21) 

The inequality is derived based on the Jensen’s inequality on 
W(-). Let Cl = inf |w(m)+W(2P—m)lm G (— 00 , P—fc]| 

and C 2 = sup |w(m) +W(2P —m)lm G [P — fc, P]|. From 
we have Ci > C 2 . Therefore we have for ( |20l l 

W(P,ai)-W(P,a2) 

p-k 

< Cl J p(m, P, ai) — p(m, P,cr 2 )dm+ 

— OO 

P _ 

C 2 J p(m, P, fxi) - p{m, P, a 2 )dm, 

p-k 

P—k 

= (Cl - C 2 ) / p{m, P,ai) - p{m,P,a 2 )dm < 0 . 

— OO 

The hrst inequality is derived since p{m, P, cri) < p{m, P, 0 - 2 ) 
for m G (—oo,P — k) and p{m,P,ai) > p(m, P, (T 2 ) for 
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m G {P — k, P). The second equality is derived from the 
property of Gaussian distributions 


P(to, P,(Tl) - p{m, P(72) 


dm = 0. 


Hence the system costs will increase when the uncertainty 
increases from cti to (T 2 


W(P„Pt,,ai) < W(P,,Pt,,(T2). (22) 


Denoting Pt^ as the minimizer for wind uncertainty level ai, 
we have 


W{Pr,Pt„ai) <W{Pr,Pt„ai) <W{Pr,Pt„a2). 


VIII. Optimal Energy Procurement 

We discuss the optimal energy purchasing strategy for 
the SBO in different market settings. We begin by focusing 
on single market participation in Section |VIII-A| where the 
SBO only participates in the day-ahead energy market by 
purchasing certain amounts of traditional and wind energy 
at given day ahead prices. On the next day, it will utilize 
what has been purchased. This analysis is then extended in 
|VIII-B| when the SBO can participate in both day-ahead and 
real time market. The SBO exersices the same choices in 
the day-ahead market as before. In addition, the SBO can 
choose to procure additional balancing energy or sell day- 
ahead purchased energy back to the market based on the 
realization of wind output. 


This ends the proof. □ 

We extend the analysis to the scenario when the wind 
output mean and variance correlate. Suppose a wind farm is 
composed of many wind turbines having relatively small value 
of output with Gaussian distributions 

6r ~ N{5m,Crj). 

Then the scaled output of the wind farm with n wind turbines 
is 

n6r N(ji6 m j (ncrs)'^), 


Clearly the standard deviation of the aggregated output a = 
nas scales with mean output Pr = nSm with constant coef¬ 
ficient of variation k = crsISm- The more expected resources 
the SBO wishes to reserve, the more uncertainty it will face, 
and hence the system costs. This is formally stated as follows. 

Corollary 1 If the wind output has correlated mean and 
variance satisfying 

P^ ^ N{Pr,{kPrf), 


then the system costs F{Pr,kPr) in will increase as P^ 
increases. 

Proof. Consider two levels of wind output < Tr- 2 ^ 
and denote Pt^ as the optimal traditional energy procure¬ 
ment under Pr^. From (22i we know that we can carefully 


choose Pt = Pt 2 
, kPr-. 


Pr 2 — Pri such that W(Pri, kPr^ , 


Pt) < 

W(P,„fcP,,,PiJ. Denoting Pj^ as the optimal solution under 
Pr^, we will have 


W{Pr„kPr„Pt,)<W{Pr„ 


kPr^ , Pt) < W {Pr2 , kPr2 , T’f 2 ) • 


This ends the proof. □ 

Proposition 5 and Corollary 1 indicate that the system 
welfare cost will increase as either the mean or the variance of 
wind resources increases. The introduction of wind resources 
does not bring benefits to real time operation. However, wind 
energy is helpful from the day-ahead point of view by reducing 
the costs of purchasing Pt that is needed otherwise. This yields 
a trade-off where we need to jointly minimize the day ahead 
energy procurement and real time operating welfare costs. In 
the following section, we discuss the optimal energy purchase 
strategy in both day-ahead and real time markets. 


A. Single Market Participation 

Suppose the SBO participates only in the day-ahead market 
and plans to optimally procure a certain number of energy 
packets from both traditional and volatile resources such that 
it minimizes the summed costs of the energy purchase and real 
time welfare costs W(Pt, Pr, cr). Here W(Pt, Pr, cr) is defined 
by Q with W(m) = g{n{ x{S{m)) - f)) containing 
only tne consumer’s waiting time to access energy packets. 
In the day ahead, the unit traditional energy price is known 
to the SBO as kt- We consider the following two statistical 
characterizations of wind energy availability. 

The first model characterises wind availability as a process 
with a fixed mean Pr and standard deviation a that cannot be 
chosen by the SBO. For instance, the SBO can locally connect 
to a wind farm within its microgrid whose statistical output is 
determined by its location, time, etc. The SBO’s objective is 
to minimize over Pt while having fixed Pr and tr: 


min ktPt+ yf{Pr,Pt,cr). 
Pt 


(23) 


Clearly (23 i is a convex function given the second term is 


convex as shown in Prop. 4. It is straightforward to find the 
unique optimal Pt. 

In the second model the wind output has correlated mean 
and variance due to the aggregation of many small windmills. 
The wind output Py has the same distribution as in Corollary 
1 

Pr^iV(Pr,(fcPr)2). 


The SBO can flexibly sign contracts for outputs of certain 
number of wind turbines in order to get wind energy in real 
time. The SBO would solve over Pt and Pr to minimize the 
total operation costs. (We omit the third argument in W(-) 
since the variance depends on the mean.) 

min ktPt + kyPy -f W(Pr, Pt). (24) 

Pt,Pr 

In ( [24| , ky is the unit reservation price for wind energy 
(satisfying kr kt) that is used to cover the low operation 
costs and capacity costs of wind turbines. Proposition 6 proves 
that W(Pt,Pr,fcPr) is jointly convex in {Pt,Pr}, and this 
guarantees a unique procurement solution. 

Proposition 6 W(Pt, Pr) is jointly convex of Pt, Pr. 
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Proof. From we have 

OO 

W{Pt,Pr) = f p(m,Pt,Pr)W(m)dm, 


— OO 
OO 


1 =e 2 (kr,y^ W(m)dm, 


f y'27rfc2p2 ' 

OO 2 

= vb / e-^W(Pt + (l + kM)Pr)dM. 

— OO 

The last equation is gotten by a change of variable M = 
. We prove the Hessian of W{Pt,Pr) is positive 


kP, 

definite 


H{W){Pt,Pr) = 


JlLw 


aPtdP, 


■W 


a^ 


aPraPt 

Based on Leibniz integral rule 

a2 - ~ ^ 


W 


JL_ 


W 


0 . 


/ e-^W"(P* + (l + fcM)P,)dM, 

* —OO 

2 - ^ 

/ e-^(l + fcM)2w"(Pt + (l + fcM)P.)dM, 

^ — OO 

2 - °° 

/ e-^(l + fcM)W"(P* + (l + fcM)P,)dM. 


(25) 


Therefore based on the Cauchy-Schwarz inequality and (25 1 , 
the determinant of the Hessian satisfies 

8'^ - 8'^ - r)2 _ 


(9P2 dp2 


'dPrdPt 
dP? 

W{Pt,Pr) is jointly convex of Pt,Pr- 


o2 

In addition > 0, so the Hessian is positive definite. 

□ 


B. Double Markets Participation 

The single market operation is a straightforward one step 
optimization problem. In a more realistic model, the SBO 
participates in both the day-ahead and real time market to 


procure energy packets. As in Section VIII-A the day ahead 
market allows the SBO to reserve a certain number of energy 
packets Pt with purchasing unit cost of kf When the actual 
wind resource Py is revealed, the SBO can either sell back or 
purchase additional energy packets based on its need. If it sells 
back energy packets, it will be credited with jkt per unit where 
7 is the discount factor for diminished value of energy. If the 
SBO wishes to procure additional balancing energy, it will be 
purchased with higher unit price kb > kf This is unknown 
in the day-ahead market, but the probability distribution p{kb) 
can be modelled and is known to the SBO based on data 
history. 

The double market participation is a two stage decision 
problem which can be solved backwards from the second 
to the first stage. In the second stage, namely the real time 
operation, the optimal strategy is to choose Xi amount of 
packets from day-ahead reservation and from balancing 
energy to solve 

min kbX 2 -^kt{Pt - xi)+'S^{xi+X 2 +Py) 

(26) 

S.t. Xi < Pt, Xi,X2 > 0 

The second term is the credit the SBO will receive by selling 
Pt — Xi energy packets back to the market. Denote the optimal 


value of p6| ), namely the real time optimal operation costs, as 
R{Pt, Py, kb), which is a function of day ahead purchase Pt, 
and the stochastic price kb and wind output Py. The first stage, 
namely the day ahead optimal reservation for Pt to hedge price 
and resource uncertainty, is to solve 

min ktPt+ E [R{Pt, Py, kb)]. (27) 

Pt>0 P„,fci 

In ( p7] ) we assume that the SBO cannot optimize over Py, 
this assumption will be relaxed in what follows. From stan¬ 
dard sensitivity analysis, we note that for a fixed pair of 
{Py,kb}, R{Pt, Py,kb) is a convex function of Pt. Therefore 

E [R{Pt, Py, kb)], which is the expected value of the op- 

Pv,kh 

timal real time operating cost overs price uncertainty kb and 
resource uncertainty Py, is also convex. A first necessary order 
condition requires 

kt + 777 E [R(Pt, Py, kb)] = 0. 

dJp'f 

Since the uncertainty is independent of Pt, we solve the 
equivalent problem 



The Lagrangian function of the real time problem is 

L{xi,X2,p.) = kbX2 - "fktiPt - a;i) + W(xi +X2 + Py) 
+pL{xi - Pt). 

Denoting p*{Pt, Py,kb) as the dual variable associated with 
the constraint Xi < Pt at optimality, then 

f—R{Pt,Py,kt,h) = -jPt - p*{Pt,Pv,kb). 
dPt 

Therefore the optimality condition is equivalent to 


(l-7)fct- E [p*{PfPy,kb)]=0. (28) 

Py,kb 


Problem (28 1 can be solved iteratively by stochastic approxi¬ 
mation with proper step size a{t) 1691 . Proposition 7 formally 
establishes the almost sure convergence of Pt to the optimum. 
Proposition 7 Denote P^ as the optimal solution to (27 1 , 


and {Py{i),kb{i)} as randomly generated sample uncertainty 
in the i—th iteration. If we update Pt according to 


Pt{i -f 1) = Pt{i) - a(*)[(l - l)kt - fJ.*{Pt{i),Pv{i),kb{i))] 

where a{i) = 1/i and p*{Pt{i),Py{i),kb{i)) is the dual 
variable for constraint Xi < Pt in gotten at optimality, 
then Pt{i) converges to P^ with probability 1 as i —b oo. 
Proof. The proof is similar to Section 4.3^.5 in l69l . □ 

A key assumption to guarantee the almost sure convergence 
is that the uncertainty in the second stage is independent of 
the decision in the first stage. The two stage problem becomes 
more difficult when the SBO can reserve and optimize over 
both Pt and Py. In this case, the uncertainty Py revealed in 
the second stage is a decision dependent variable based on Py. 
For the day ahead decision, we solve 

min ktPt + kyPy + E [R{Pt, Py, kb)] 

Pt,Pr>0 Py,kb 

S.t. Py ~ N{Py, {kPy)^) 


( 29 ) 
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where is the unit cost for reserving wind energy, and Py 
is a Gaussian random variable depending on the decision 
Py. The optimality condition for Pt is the same as in ( p8] l. 
The optimality condition for Py can be derived by taking the 


derivative of the objective function in (291 


d 


■ + 75 ^ E 

Ory 


- ky -\- 


fcr + P / R(Pt, Pv, kb) yj^p{Pv\Py)dPv^ 

fe. + p[ / RiPt,Pv,kb)^i^i^0^ - 

fcf, Lp "T \ rj J 

P ^ [r(P„ Py , - 1)]. 






Py 


P ^ 


1 .Pv{Pv — Py) 


{kPy)^ 

For notational simplicity, we let 

1 


- l)p(P„|P,). 


fiPv,Pr) = 


Pr 


f PyjPy - Py) _ ,x 


Proof. It is equivalent to prove that E [R(Pt, Py, ^b)] is a 

Py,kb 

convex function of Pt {Py) while the other variable is fixed. 
Note that 


E R{Pt,Py,kb) 

Py,kbl 


E 

kb - 


E 

kb - 


J R(Pt, Py, kb)p(Pv\Py)dPi 
f R(Pt,(l + kM)Py,kb)e-'^dM 


(33) 


(30) 


The second equality is gotten since the conditional distribution 
is p{Py\Py) = e ~^, and therefore 


and substitute f{Py,Py) into (30i to get the optimality con¬ 
dition for Py 

ky+ E [R{Pt,Py,kb)f{Py,Py)] ^0. 

Pv,kb 

Hence the day-ahead procurement can be gotten by jointly 
solving I 


where the second equation is gotten by a change of variable 
M = ■ It is left to prove that R(Pt, (1 -I- kM)Py, kb) is 

convex. In what follow, we prove the convexity of ( [33] ) w.r.t 
Py. The proof for Pt follows similarly. 

Note that the balancing energy price is strictly greater than 
day-ahead energy price, i.e. kb > p. Therefore the SBO will 
have only three purchasing options (i) purchase neither xi,X 2 , 
(ii) purchase X 2 = 0 and xi < Pt, and (iii) purchase xi = Pt 
and a ;2 > 0. The optimal real time operation cost will have 
the corresponding structure: 

R{Pt,il + kM)Py,kb) 
r -7fctPt+W((l + A:M)P,),if (i) 

= < -7fct(Pt - xi)-I-W(a:i-I-(1-I-fcM)Pr), if (ii) 

[ kbX2+yf{Pt + X2 + {1 + kM)Py), if (in) 

(34) 

(32) Namely the real time cost is a piece-wise continuous function 


(31) 


and (28 I. Remember the almost sure convergence 
of the stochastic approximation algorithm requires that the 
uncertainty is independent of the control. This is satisfied for 
Pt which will not affect uncertainties, but is partially satisfied 
for Py which will affect Py. Therefore the stochastic update 
for Pt and Py will not have the same structure, namely in 
updating Py we need to get rid off the affect of Py by taking 
multiple samples. 

We will present three stochastic approximation algorithms 
to solve for the optimal Pt and Py. The first algorithm shares 
the same algorithmic structure as Proposition 7 and tries to 
update Pt and Py separately. For a fixed Py, we update Pt as 
in Proposition 7. For a fixed Pt, we update Py by evaluating 
the objective function’s stochastic gradient with respect to kb 
uncertainty. This algorithm will guarantee to reach optimality 
almost surely, but the execution is slow due to time consuming 
evaluations in updating Py where we need to take multiple 
samples of Py to estimate the true gradient. The second 
algorithm, which simultaneously updates Py and Pt, is fast 
in reaching a neighbourhood of the optimal solution, but it 
will persist to oscillate and not converge. The third algorithm 
is a combination of the first two algorithms. It first uses 
the second algorithm to approach the neighbourhood of the 
optimal solution, and then it switches to the first algorithm to 
guarantee the almost sure convergence. 

Before proceeding to the three algorithms, we first prove 
that the problem defined in ( [29] l has the following property 
that guarantees the uniqueness of the optimal solution. 

Proposition 8 There is a unique local optimal Pt (Py) that 
minimizes ( [29| ), which is also global optimal for a fixed level 
of the other variable. 


having three parts corresponding to (34i. Since W(-) is convex, 
each piece-wise function is convex. In addition, both sub¬ 
gradients exist at the points that joint cases (i), (ii) and cases 
(ii), (iii). Therefore the piecewise function is convex in □ 

Algorithm 1 below is based on Proposition 8 that indicates 
that there is a unique optimal solution to update either Pt or Py 
while having the other fixed. According to Proposition 7, if the 
uncertainty is independent of the decision variable, stochastic 
approximation can be used to find the optimal solution. This 
indicates that we can iteratively update Pt and Py to approach 
the global optimal solution. 

Algorithm 1 

(1) Set initial value of Pt{0) and Py{0). Set j = 0. 

(2) Update Pt for fixed Pr. Let Pf = Pt{j) and for 
i = 0,1,..., M with some large M generate i.i.d samples of 


kl, and Py based on Py{j). Solve (261 to get the dual variable 


fi*{Pl,Py,kl). Update PI according to 

= PI - a,((l - 7)fct - p*{Pl,Pl,kl)). 

Let Pt{j + 1) = P^ after the M—th iteration. 

(3) Update Pr for fixed Pt. Let P° = Py{j) and for 
i = 0,1, ...,M with some large M generate i.i.d samples of 
Solve (26 1 under different Py to get the dual variable R(Pt {j + 
l),Py,kl). Calculate the derivative w.r.t Py by the following 
integration 

J RiPt, Py,kb)f{Py, P^:)piPy\P^)dPy, (35) 


where /(•, •) is defined in (311, and update Pi) according to 

P;+l = P;-a, [fcr+ J RiPt,Py,kb)f{Py,P))p{Py\P))dPy 

Pt, 
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Algorithm 1 



Optimal Pr 

(a) Algorithm 1 favors almost sure convergence. 


Algorithm 2 



38 39 40 41 42 43 44 45 46 

Optimal Pr 


(b) Algorithm 2 favors computational efficiency. 


Algorithm 3 



Optimal Pr 

(c) Algorithm 3 combines the advantages. 

Fig. 4. (a) Almost sure convergence of Pf and Pr generated by stochastic 

approximation Algorithm 1. In each iteration two sub-problems are solved. 
We first fix Pr to get the optimal Pt, and then fix Pt to get the optimal Pr. 
Almost sure convergence to the optimal is guaranteed for both sub-problems. 
In the end global optimality is achieved by iteratively solving the two sub¬ 
problems. (b) Algorithm 2 updates both Pt and Pr at the same iteration 
based on random samples of and Py. In this algorithm the generated 
sequence of P^ will converge to the near optimal solution P^, but P^ will 
persist in oscillating around the optimal solution P* . (c) Algorithm 3 first 
adopts Algorithm 2 to update both Pt and Pr at the same iteration based on 
random samples of kf) and Py until Pt converges to Pt(0) with some Pr(0). 
Using Pt{0) and Pr{0) as a starting point, it then switches to Algorithm 1 
to separately update Pr{i) and Pt{i) one at a time. Within few iterations 
(typically less than 3) of separate updating, both Pt{i) and Pr{i) converge. 


Let Pr{j + 1) = after the M—th iteration. 

(4) Convergence Criterion. If \Pt{j) — Pt{j + 1)\ < e and 


\Pr{j) — Pr{j + ^)\ < e. return the almost sure optimal solution 
Pf = Pt{j + 1) and P* — Pr{j + 1). Otherwise increase j 
by 1 and go to step (2). 


Fig. 4(a) shows the trajectory generated by Algorithm 1. 
With sufficiently large M chosen in step (2) and (3), this 
algorithm will find the optimal solution for Pt and P^.. 
Algorithm 1 will converge almost surely to the global optimal 
P(* and P* since step (2) and step (3) both converge to the 
almost surely optimal solution at the corresponding steps and 
that the problem itself is jointly convex of {Pt,Pr}- 

One issue for Algorithm 1 is the computational effort 
needed in step (3) to accurately calculate the derivative of 
Pr- We need to consider all possible realizations of Py, solve 


(261 for each Py, and integrate ( [35) . The second algorithm 
avoids the explicit calculation of P5). Instead it updates Pt 
and Py at the same iteration based on samples of kb and Py. 

Algorithm 2 

(1) Set initial value of and Py. 

(2) Update Pr and Pt. For i = 0,generate 
and Py based on Py. Solve (26 1 to get R(P(,P^,fc^) and 


fj,*{Pt,Pi,kl). Update Pj and Py 


p:+^ = p:- a. 


(l-J)kt-^i*iP^,Pi,kl) 


p;+i = p; -a, ky+ R(P^ P*, kl)f{Pl,P^) 


Return P* = P^ 


M 


and Py = Py as the near optimal solution. 
Fig. |4(b)| shows the trajectory generated by Algorithm 2 


where the solution converges to the optimal in Pt direction, 
but will persist to oscillate inside a set containing the optimal 
solution Py. Algorithm 2 is fast compared with Algorithm 1 


since each time we solve (26 1 , we get the optimal cost as well 
as the dual variable that are used to update both Pt and Py. 
However, after a certain number of iterations this algorithm 
becomes inefficient since Py will oscillate while Pt will stay 
at the same level. 

The above two algorithms show a trade off between the 
almost sure convergence and the speed (computational effort) 
in calculating the sample derivative of Py. Since the second 
algorithm can successfully drive the trajectory to a region 
that is close to almost sure optimality, we can combine the 
advantages of the two algorithms such that we first speed to 
the neighbourhood of optimality and then switch to Algorithm 
1 to avoid oscillation of Py. This inspires the Algorithm 3 
which is given as follows. 

Algorithm 3 

(1) Run Algorithm 2 until Pt converges. Typically the 
converged value is close to the optimal solution P^. Denote 
the solution by Pt(0) and Pr(0). Set j = 0. 

(2) Run step (3) in Algorithm 1 to update Py{j) to Py{j + 1) 
while fixing Pt{j). 

(3) Run step (2) in Algorithm 1 to update Pt{j) to Pt{j +1) 
while fixing Py{j + 1). 

(4) Check convergence of Py{j + 1) and Pt(j + 1) according 
to convergence criterion (4). If not, increase j by 1 go back 
to step (2). 


Fig. 4(c) shows the trajectory generated by Algorithm 3. 
The initial trajectory is similarly generated as in Algorithm 2. 
When Pt converges, we switch to Algorithm 1 to avoid the 
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oscillation phenomenon observed in Algorithm 2. Since Pt(0) 
gotten at the end of step (1) is very close to the real optimal 
solution P^, we can expect a few iterations between step (2) 
to step (4) before the final convergence. 

IX. Control and Market Aspects for the Binary 
Information PDLC 

We would like address properties of the binary informa¬ 
tion based PDLC framework from both control and market 
perspectives. In terms of control, we seek to understand the 
fundamental trade off between the benefit of providing flexible 
energy provision to consumers and the costs of facing larger 
system instability. In terms of markets, we will address the 
effect of wind energy on market contracts. 


A. Demand Uncertainty vi. Consumer’s Utility 


It is well known that a trade off exists between providing 
consumers with higher flexibility and ensuring a better con¬ 
trolled and predictable system iTOl . For example, in numerous 
papers price based indirect load control frameworks have been 
proposed that allow consumers to independently decide their 
energy preferences based on real time price signals. However, 
few demand response programs nowadays actually exercise 
such practice due to the high unpredictability of the system 
response. DLC mechanisms, on the other hand, restrict the 
allowable actions by consumers to regulate their appliances 
in order to achieve accurate and robust performance of the 
distribution network. In our binary information based PDLC 
framework, there are two parameters governing the trade off 
between flexibility and controllability - the number of reserved 
energy packets m and packet length (decision duration) 5. 
Higher level of m or smaller value of 5 provides flexibility 
as the system is enabled with a higher maximum capacity and 
more frequent decision making possibility 

We numerically illustrate this trade off in Fig. |^for system 
uncertainty Var(n(TO,(5)) and mean waiting time W{m,6). It 
can be seen from Fig. |5(a)| that the variance of the packet con¬ 
sumption is affected by (i) the maximum reservation amount 
of energy packets m and (ii) the duration of packet length 
5. The former has a greater effect. Regardless of S, demand 
uncertainty is static when we either provide absolute freedom 
(with large m) or control (with small m) to consumers. We 
also observe that Var(n(m, (5)) increases linearly with respect 
to values of m around the optimal procurement level. 

The number of reserved packets m also has a greater effects 


on W{m^ (5) as shown in Fig. (5(b) i. With a moderate amount 


of reservation, W{m, 5) can be reduced to a value less than 20 
seconds. Considering that the duty cycle on and off time are 
typically around 10 minutes, delaying 20 seconds will only 
increase the room temperature beyond the boundary by 5% of 
the comfort band width. If additional reduction of W{m, 5) 
is desired, the SBO can choose to have smaller 5 which 
will have larger effect than reserving additional packets. The 
choice of small 5 does have small effect of increasing demand 
uncertainty. 



Roservod Packets 0.5 


(a) Variance of demand reflects the controllability of the 
PDLC. 



Packet Length 


3 


(b) Mean Waiting Time reflects consumers’ dis-comfort of 
having temperature rise beyond the desired range. 

Fig. 5. Trade off between flexibility and controllability. System uncertainty 
would monotonically increase as either we reduce decision making period, 
i.e. reducing packet length <5, or we increase the allowable maximum system 
capacity, i.e. increasing amount of packet reservation m. Consumers flexibility 
will have opposite characteristics to the system uncertainty. 


B. Wind Energy Impacts 

We would like to quantify the value of wind energy in 
market operations. The optimal usage of wind energy is to 
balance between: (i) the benefit it brings to the system welfare 
because of its low operating costs, and (ii) the uncertainty it 
brings to the real time operation such that more operating 
reservations, both traditional and renewable, are needed. One 
important aspect is to And how wind quality will affect the 
optimal procurement of wind energy, and its consequential 
effects on potential contacts between wind producers and the 
SBO. Higher uncertainty in wind availability will discourage 
the demand side from using wind resources in favor of 
continued reliance on traditional energy sources. In addition, 
the operation costs of wind k^. will also have an impact on the 
energy contract between the SBO and the wind farm. 

Table |I] and Table [n| show the optimal energy procurement 
for Pr and Pt as a function of wind quality and cost with 
coefficient of variations k and wind operational costs kr- We 
And a linear dependence between k and the contracted wind 
output Pr where Pj. decreases as k increases, k^ also has a 
minor effect on the optimal Pr as long as kr is relatively lower 
than the costs of day-ahead energy reservation. As expected. 
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TABLE I 

Optimal Contract of the Wind Output as Wind Price or 
Quality Changes 


k kp 

0.02 

0.04 

0.06 

0.08 

0.10 

0.05 

52.66 

52.37 

51.97 

51.52 

50.77 

0.10 

49.88 

49.43 

48.86 

48.11 

47.52 

0.15 

47.05 

46.52 

45.87 

44.91 

43.93 

0.20 

44.41 

43.68 

42.92 

41.97 

40.84 

0.25 

41.75 

41.03 

40.20 

39.17 

38.14 

0.30 

39.52 

38.61 

37.80 

36.86 

35.59 


TABLE II 

Optimal Contract of Traditional Energy as Wind Price or 
Quality Changes 


kp 

0.02 

0.04 

0.06 

0.08 

0.10 

0.05 

3.50 

3.72 

4.14 

4.53 

4.95 

0.10 

7.89 

8.08 

8.86 

9.56 

9.84 

0.15 

12.14 

12.31 

12.84 

13.91 

14.77 

0.20 

15.95 

16.49 

17.09 

17.89 

18.93 

0.25 

20.11 

20.47 

20.90 

21.78 

21.66 

0.30 

22.48 

23.61 

23.48 

24.51 

25.65 


the demand for Pt will increase in a way that is complementary 
to a decreased demand for P^. It is clear that total amount of 
energy reservation, i.e. Pt + Pr will increase as either the wind 
quality deteriorates or wind costs decrease. 

X. Conclusion and Future Path 

Smart buildings account for the greatest share of microgrid 
energy demand. They also provide the greatest opportunity for 
management and control. A new operating protocol, called 
packetized direct load control (PDLC), has been introduced 
and proposed for various communication and control settings 
for the smart building operation. Two levels of possible 
communications are considered that comprise (i) an ideal 
scenario where consumers’ allow the operator to access their 
real time full information with an erasure free channel, and 
(ii) a constrained information exchange scenario where the 
operator has limited access regarding the binary desirability 
of consumers electricity preference. We show the fundamental 
trade off between achieving controllability of the system and 
endowing flexibility to consumers within the PDLC as the 
operator varies the reservation capacity or decision interval 
duration. This trade off is further mathematically defined and 
unified into either an energy metric or a consumer welfare 
metric for optimization purposes. 

Based on these metrics, the concept of PDLC is embedded 
into market settings where we consider the SBO’s participation 
in the cascading of day-ahead and real time markets for opti¬ 
mal energy procurement. We have proposed three algorithms 
that solve the corresponding stochastic approximation problem 
and whose result gives the optimal energy management solu¬ 
tion to purchase a mixed portfolio of traditional and renewable 
(wind) energy. Along the course, we proved the uniqueness 
of the optimal solution and the almost sure convergence of 
our algorithms. The impact of wind energy with respect to its 
quality and costs is numerically addressed. 

We believe that the PDLC framework can be fit into 
broader settings in optimal control of demand response and 
optimal operation of electricity markets. This paper serves 


as a seminal work discussing the role of PDLC operation 
in the energy market. One direct extension is to consider 
the PDLC in reserve markets where the operator can flexibly 
modulate aggregate consumption up and down by rationing 
energy packets based on the needs of consumers. The role of 
information is a critical enabler of optimized performance of 
building energy systems. 

A further extension along the path is to explore the pos¬ 
sibility of using energy packets to reach higher operational 
granularity that is not necessarily restricted within the direct 
load control framework. Price based protocols would easily 
be adapted where consumers can potentially sign packet- 
based contracts containing various choices of packet duration 
depending on their beliefs on prices and their need for energy. 
Similar to the above, packet contract duration and allowable 
reservation capacity will play a role in balancing system 
controllability and consumer flexibility. 
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